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Abstract 

We all go through scores of web-pages everyday, in search 
of required information. At times we become satisfied with 
the content of some pages, at time we fail to. An objective 
framework that attempts to model the user satisfaction when 
he searches for some desired piece of information, is essential 
for Human-Computer-Interaction. In the present work, a sim- 
ple probabilistic model is constructed to achieve precisely the 
same. This realistic yet strictly mathematical study proposes 
a marker, the 'satisfaction retentivity quotient', to model the 
complex realm of user psychology as he attempts to find re- 
quired information and forgets some bits of it here and there, 
simultaneously. 
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We read hundreds of web-pages everyday to find information of our interest. 
Startling extent of inter connectivity [1] of the web-documents have made life 
easier for us while attempting to retrieve certain piece of information from the 
Internet. We all know how happy we become to find some bits of relevant 
information and we know how irritated we become not to find the same in 
scores of pages. While significant progress has been made in many realms of 
'Human-Computer Interaction' (HCI), a fundamental aspect of the field, namely 
an objective model that attempts to capture the states of satisfaction of a user 
as he traverses through the web in search of a particular set of information; still 
eludes the students of HCI and in general, people who use Internet. Although 
some efforts have been made to quantify certain aspects of 'user satisfaction' 
([2] studied it as a function of search engines, [3] had attempted to describe it 
with respect to traffic demand and capacity of each link in the network, still 
differently, [4] attempted to describe it as a function of information retrieval), 
a realistic picture of the changing state of satisfaction level of an Internet user, 
does not generally emerge from these works. Here, a simple probabilistic model 
is proposed, which tries to model the user psychology as he wanders around un- 
known web-sites in search of a particular set of desired information. An honest 
attempt is made to describe the process as it is, instead of assuming the user to 
perform unrealistic operations. 

Let us attempt to model a situation when a user is searching for a chunk of 
information, of somewhat fuzzy nature (because in most of the cases, user him- 
self does not possess categorical knowledge about what exactly he is searching 
for, but becomes aware of his precise needs when he starts the searching op- 
eration). Similarly, it is not exactly commonplace for the user to hit-upon a 
particular web-site where the entire bulk of information would be available to 
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him in one magical attempt. Instead, he/she gathers bits and pieces of rele- 
vant information from various web-sites. We assume that the required set of 
information can be broken down to an arbitrarily large number of small bits of 
information, which the user accumulates and arranges in his mind suitably, to 
gather the desired knowledge. Let us assume that a flux of tiny bits of pertinent 
information in the very first page of any web-site is what captures the imagi- 
nation of the user (say, U) and motivates him to read the content of the entire 
site in sufficient detail, in the hope of collecting the maximum content of infor- 
mation about the question in his(her, at any rate) mind. We assume further 
that the satisfaction level of U is purely a function of the level of information 
he is receiving from the web-site (in this work, the design features of the web- 
site that do not contribute to the existing level of information content of U, is 
not considered as a reasonable parameter which might influence the satisfaction 
level of U). We assume further that U's coming across these tiny bits of perti- 
nent information forms an elementary flow of intensity A (scaler rate parameter) . 

It is realistic to assume that the very moment U touches upon a piece of infor- 
mation that he thinks might take him closer to the desired piece of information 
he is searching for, his satisfaction level grows. To make the calculation simpler, 
we assume that this growth of satisfaction in U, takes place with a constant scale 
of unit magnitude. However, as it generally happens in real-life, very soon U 
realizes that any typically encountered tiny piece of hint of an information is not 
taking him closer to the set of information he wishes to possess, but is aiming 
at something else that aren't exactly related to the premise of question he is 
interested in. His satisfaction level therefore starts to decay. To represent the 
situation reasonably, we assume this decay function to be having an exponential 
nature (rather than having a two-state (or some other) characteristics), with a 
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parameter /i. Thus, while A is influenced predominantly by the content of the 
site; origin of [i is complex, because of its dependence on various parameters. 
Nevertheless, those tiny sources of information satisfies U to some extent and a 
cumulative effect of these acquired information starts to build up in his mind. 
We assume that the gradual growth of residual satisfaction of U (attained with 
the bits of pieces of acquired information from the web-site) over the travers- 
ing time t, through the web-page, can be captured by summing them up. We 
designate the user-satisfaction level by X(t). In the present work, a simple 
mathematical model is proposed which attempts to find the characteristics of 
this growth of user-satisfaction upon browsing through a web-site. 

At this point we segment our study in two cases. 
Case 1) : 

Let us assume that U touches upon the pertinent information at random mo- 
ments, Ti,T2, . . . ,7j, . . . , which forms an elementary flow of events. The user 
satisfaction level at any arbitrarily chosen moment t, due to his interaction with 
any particular bit of information (say, i th bit of information) encountered at the 
moment Tj, is given by : 



Hence compositely we can write, Si(t) = l(t — Tj)e , where l(t) is a unit 

function; T t > 0, t > 0. 

Let us now define a random variable fi, which describes the number of such 
tiny bits of information that influence the satisfaction level of U. This variable, 



Si(t) = 0, (*<!*) 



(1) 



e-"<*- T '>,(i >Ti) 



(2) 
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to be realistic, will be having a Poisson distribution with parameter Xt ([5-8]). 
Further, to describe the real-life situation properly, we represent the user- 
satisfaction level X(t) as the sum of random number of random terms : 



Since a Poisson flow of events on any interval (0, t) can be represented, with suf- 
ficient accuracy as a collection of points on that interval (described in the 'Sup- 
plementary Material', available on request), the coordinate of which «j € (0, i) 
is uniformly distributed on that interval and does not depend on the coordi- 
nates of other points. This is natural to expect even from a non-mathematical 
intuitive understanding of the situation also. Because the user U comes across 
all these points (representing the exact instance of finding a bit of relevant in- 
formation) during the interval (0, t) and this description exhaustively represents 
the entire event space of favorable encounter for the user during (0,t). 

Therefore, eq™-3 can be re-written in the form : 

n 

X{t) = J2 e ~ tt{t ~°' i) ( 4 ) 

i=l 

where the random variables are independent and uniformly distributed in 
the interval (0, t). 

Since the satisfaction of U is a function of interplay of A and jj,, and practi- 
cal experience suggests it to be having a cumulative nature, we can attempt 
to model user satisfaction as a resultant of each event of favorable information 
gathering and unfavorable decay. We designate Xi(t) = e - ^* - " 1 ' = e~ tlt e IJ ' ai , 



n 




(3) 
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where Xi(t) represents each of these tiny events. Hence we have : 



X(t) = £x i (t) = e-'"X><" 



(5) 




where Xi(t) are independent similarly distributed random variables, and the 
random variable Vt does not depend upon the random variables Xi(t) either. 
Here we note that although X(t) is cumulative in nature, essentially it is a 
stochastic process. 

At this moment, we invoke the known formula regarding mean value and vari- 
ance of the sum of a random number of random variables [9], [10] (if random 
variable Z is a sum Z = ^ i=1 Xi, where the random variables Xi are indepen- 
dent and have the same distribution with mean value m x and variance Var^; 
the number of terms fl is an integral random variable which does not depend 
upon terms of Xf, has a mean value mn and variance Vara; we know the mean 
value m z and variance Var z of the random variable are given by : m z — m x mn 
and Var z — Var x rriQ + m x Varn). Whereby in the present case, we have : 



Since the random variable has a Poisson distribution with parameter \t, it 
follows that mn(t) = Varn(t) = Xt. 

To find m Xi (t) : 



m x (t) = mn{t)m Xi (t) 



(6) 



and 



Var x {t) = m n {t)Var Xi (t) + Var n (t)m 2 Xt (t) 



(7) 
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rn 



Zt (t) = E[X l {t)]^\S>-^- x) ^=- 



fit 



However, it is pragmatic to assume that the process of estimation of m Xi (t) 
in user's mind to be less than smooth and therefore to describe the situation 
realistically, we need to calculate some quantity analogous to moment of iner- 
tia of m Xi (t), if in the mental space m Xi (t) is described as a line-shaped object. 
Hence we determine the second moment about the origin of the random variable 
Xi{t) : 



Hence, 



1 _ e -M* 

m x {t) = A (8) 



i _ e -2nt 

Var x (t) = \t[Var Xi (t) + m 2 Xt (t)] = XtE[Xf(t)] = A — (9) 



It is interesting to notice that as t — > oo, the mean value and variance of the 
process X(t) do not depend on time, since 



lim t ^oom x = m x = - (10) 

and 



lim t ^ooVar x (t) = Var x = (11) 
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This is expected purely from an intuitive perspective also. After traversing 
through the web-site(s) for a sufficiently long time, the user is expected to 
gather a finite amount of desirable information. However, since he fails to re- 
member all of it, only a fraction of the amassed information will be retained 
by him. Hence the fraction ^ can be named as 'satisfaction retentivity quotient'. 

If we represent a sufficiently a large number by L, then the distribution pro- 
file of the section of the stochastic process X(t) for m x = ^ > L, can be 
interesting. For this we consider a finite but sufficiently large interval (0, t) and 
assume that for some sufficiently large number of times f2, user's interaction 
with the desired piece of information takes place on that interval. For such a 
situation we see that the process X(t) (eq™-3) is a sum of independent similarly 
distributed random variables, which has an approximately normal distribution 
(since in this case the conditions of the central limit theorem are in fact ful- 
filled). Henceforth, the section of the stochastic process can be considered to be 
having a normal distribution with characteristics m x = ^ and Var x = j^. 

To understand the characteristics of user satisfaction, it is imperative to form 
an idea about the evolution of it over the time interval of his browsing through 
web-sit (s). Hence we proceed to find the correlation function between user sat- 
isfaction profiles during different instances of browsing operation. This can be 
done by considering two sections of the stochastic process in question, at the 
moments t and t'(t' > t). By virtue of the assumption made, we can assert that 
the user satisfaction X(t') at the moment t' , is equal to the extent of satisfac- 
tion X(t) at the moment t multiplied by the exponent e - ^* ~*\ added with the 
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satisfaction fl(t' — t), which comes into being due to user's coming across some 
interesting bits of information during the time interval (t' — t). Hence X(t') is 
given by : 

X(t') = [X(t)e- X(t '-^ + n(t' - t)] (12) 



The stochastic processes X{t) and Q(i' — t) are evidently independent since they 
are generated due to user's interaction with desired piece of information during 
different, non-overlapping time intervals (0,t) and (t, t') respectively. 

The same can be said about the centered stochastic processes X(t) and Cl(t' — t), 
where we define X(t) = X{t) - m x (t) and fl(t' - 1) = Q.{t' - t) - m n (t' - t) as 
centered random functions of the aforementioned stochastic processes. 
Hence, using eq™-12 we have : 



C x {t,t') = 



E X(t)X(t') 
E 



= E 
= E 



(i:(t)) 2 e-^'-^ifit' > t) 



(x(t'))' 



-n(t-t') 



if(t' < t) 



Thus the correlation can be compositely expressed as : 



C x (t,t') = Var x (min(t,t')) \l - e 2amm ^'A 



(13) 
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Let us consider the limiting behavior of the stochastic process when t — > oo, 
t' — > oo, but the magnitude of their difference r = t' — t is finite. In this case, 
C I (T) = Var x e-"M = £e-"M. 

Hence the stochastic process -X'(t) representing user satisfaction practically at- 
tains stationarity in every aspect when the user spends a long time searching 
for some desired bulk of information, which conforms to our experience. Fur- 
thermore, its nature assumes that of a normal distribution when user searches 
for long (in other words, (t — > oo), (t' — > oo)) and ^ > L. 

Of course the user can hit upon a web-site where the information regarding 
all of his interest is kept in one place. In such (unlikely) case, naturally \l — > 0. 
Here the extent of user satisfaction will be a Poisson process since every new 
piece of information that the user will be encountering will exactly match with 
the desired set of information he wanted to collate. Consequently, the decay in 
user's interest will occur minimally. In such a case, the expressions for m x (t), 
Var x (t) and C x (t,t') will assume the form : 

lim t -Hx>'m x (t) = Um^oX 1 ^^ - = lim^ Var x (t) = Um^oX 1 '^^ = Xt 
lim^ C x (t,t') = lim^ ± \l - e 2 ^ min ^] e -Ml*'— *l = X[min{t,t')} 
Case 2) : 

In some other real-life cases another situation is frequently encountered. When 
certain related pieces of information from a web-site conform to user's desired 
set of information and user comes across these related pieces of information 
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in a somewhat quantized form. Although this case is similar to one discussed 
already, there are certain subtle differences. To find the characteristics of user 
satisfaction level in this situation, we assume the applicability of the assump- 
tions made earlier and at the same time assume further that user's coming across 
such quantum of desired information form an elementary flow with intensity A. 
The exact number of information that constitute any i th quantum of desired 
information is assumed to be a random variable Ri, which, keeping with the 
real-life situation, is obviously independent of the number of information that 
constitutes any other quantum. The random variable Ri has a distribution f(R) 
with characteristics of m R and var R . 

Just like the case where user was encountering the desired information in bits 
and pieces(eq™3), here too we can represent the extent of user-satisfaction by : 

n 

X(t) = J2 R i e ~' l{t ~ ai) ( 14 ) 
»=i 

where the random variables fl, Ri and «j are mutually independent. 
Keeping with the case-1 approach, we designate Xi(t) = i?je~ A ( t_Q!i ) and then 



E[Xi(t)}=m R 



l-e~ 
fit 



and E [Xf(t)] = (Var R + m%) 



1-e 




Henceforth 



m x (t) = Xttir 



1 - 



(15) 



and 



Var x (t) = \[(Var R + m 2 R ) 



1 - e- 2 ^ 



(16) 



2(i 
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Since ttir > and Vara > 0, eq™-15 will grow faster than eq™-8, similarly 
eq™-16 will grow faster than eq™-9. This is completely in agreement with prac- 
tical experiences. Since user comes across the desired bulk of information in a 
coherent quantized form, he acquisition of knowledge becomes fast. 



This basic scheme of swiftness of knowledge gathering (obviously) doesn't change 
when the user browses for a long time and at the limiting case t — > oo, we have: 



lim t -, 00 m x (t) = ttl x = 
limt^Var^t) = Var x = A(Va ^ +m » ) 
C x {t) = Var x e-^\ 



Conclusion : 

A probabilistic model is proposed here that describes the satisfaction profile of 
a user when he browses through web-site (s) in search of a desired set of informa- 
tion. Since the results obtained from theoretical considerations seem to agree 
pretty much with our routine experiences, the reliability of this attempt can be 
considered trustworthy. The model points to a stationarity in user satisfaction 
profile when the browsing operation continues for a long time. Most impor- 
tantly, it suggests a marker, the 'satisfaction retentivity quotient' that captures 
the essence of the entire process and can help in objective description of many 
of the processes that the rapidly emerging field of HCI attempts to model. 
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